Mon. Not. R. Astron. Soc. 000, 000-000 (0000) 



Printed 10 April 2008 



(MN WTtfX. style file v2.2) 



A Method to Measure the Mass of Damped Lya Absorber 
Host Galaxies Using Fluctuations in 21cm Emission 



J. Stuart B. Wyithe 

School of Physics, University of Melbourne, Parkville, Victoria, Australia 
oo Email: swyithe@unimelb.edu.au 

o : 

10 April 2008 



ABSTRACT 



Observations of damped Lya absorbers (DLA) indicate that the fraction of hydro- 
gen in its neutral form (HI) is significant by mass at all redshifts. This gas represents 
the reservoir of material that is available for star formation at late times. As a result, 
observational identification of the systems in which this neutral hydrogen resides is an 
important missing ingredient in models of galaxy formation. Precise identification of 
DLA host mass via traditional clustering studies is not practical owing to the small 
numbers of known systems being spread across sparsely distributed sight lines. How- 
ever following the completion of reionization, 21cm surface brightness fluctuations will 
be dominated by neutral hydrogen in DLAs. No individual DLAs could be detected in 
21cm emission. Rather, observations of these fluctuations will measure the combined 
clustering signal from all DLAs within a large volume. We show that measurement of 
the spherically averaged power-spectrum of 21cm intensity fluctuations due to DLAs 
could be used to measure the galaxy bias for DLA host galaxies when combined with 
an independent measurement of the cosmological HI mass density from quasar absorp- 
tion studies. Utilising this technique, the low frequency arrays now under construction 
could measure the characteristic DLA host mass with a statistical precision as low as 
0.3 dex at z > 4. In addition, high signal-to-noise observations of the peculiar-motion 
induced anisotropy of the power-spectrum would facilitate measurement of both the 
DLA host mass and the cosmic HI density directly from 21cm fluctuations. By ex- 
ploiting this anisotropy, a second generation of low frequency arrays with an order of 
magnitude increase in collecting area could measure the values of cosmic HI density 
and DLA host mass, with uncertainties of a few percent and a few tens of percent 
respectively. 

Key words: cosmology: diffuse radiation, large scale structure, theory - galaxies: 
high redshift, inter-galactic medium 



1 INTRODUCTION 

The primary avenue for study of the physical properties 
of the inter-galactic medium utilises the Lya absorption 
systems observed along the lines-of-sight towards high red- 
shift quasars. These absorbing systems include the Lya for- 
est (with column depth n < 10 17 cm -2 ), Ly-limit systems 
(10 17 cm~ 2 < n < 2x 10 20 cm -2 ), and damped Lyo absorbers 
(DLA, n > 2 x 10 20 cm~ 2 ). The latter are self-shielded and 
have been shown to host > 80% of the neutral hydrogen 
(HI) gas during most of cosmic time z < 5 (Prochaska et 
al. 2005). 

As the primary reservoirs of neutral gas, it is thought 
that DLAs provide the dominant sites for star formation, 
and represent the progenitors of modern galaxies (Woolfe, 



Gawiser & Prochaska 2005). Thus the identity of DLA hosts 
is crucial for our understanding of galaxy formation, and of 
the buildup of stellar mass in the Universe. DLA hosts have 
been identified in a few cases (e.g. Colbert & Malkan 2002; 
Warren et al. 2001), and observations of these (with some 
exceptions) are consistent with their being at the faint end 
of the Lyman break galaxy (LBG) population. On the other 
hand, these studies will more easily discover hosts in the 
most luminous systems, and could therefore be systemat- 
ically biased in their conclusions. Within the cold-dark- 
matter model the masses of objects may be determined in- 
dependently from their luminosity by studying their clus- 
tering properties and appealing to theoretical predictions 
of the bias relative to the underlying mass distribution. At 
high redshift z > 3, clustering analyses have been used in 
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this way to study the host halo masses of quasars (Shen et 
al. 2007; White, Martini & Cohn 2007), and LBGs (Adel- 
berger et al. 2005). However the numbers of DLAs known 
is small, occurring only once in every few quasar spectra, 
and so no sample of sufficient size to allow for an auto- 
correlation clustering analysis exists (Cooke et al. 2006). As 
a result a quantitative measurement of DLA host mass has 
remained illusive. Some progress has been made through 
comparison of clustering with the LBG population through 
cross-correlation. Cooke et al. (2006) have performed a sur- 
vey for LBGs in regions surrounding known DLA systems. 
This sample allows for measurement of the cross-correlation 
between DLAs and LBGs. Cook et al. (2006) find the cross- 
correlation length to be comparable to the correlation length 
(Adelberger et al. 2005) among the large LBG sample of 
Steidel et al. (2003), and so conclude that DLA systems 
have similar masses to LBGs. They argue for a host halo 
mass of 10 9 A/ Q < M < f0 12 M Q . This interpretation is con- 
sistent with modeling studies carried out in a cosmological 
framework (Nagamine et al. 2007). 

Alternatively, the presence of cold gas is also traced by 
Mgll absorption. Indeed, Mgll absorbers have been shown 
to be associated with neutral hydrogen absorbers over a 
range of column densities, including DLAs (e.g. Rao et 
al. 2006), and so Mgll absorption is used as a proxy to study 
the HI galaxy population at z < 1.5. The host halo masses 
of Mgll absorbers have been estimated at z ~ 0.5 via cross- 
correlation with luminous red galaxies in the Sloan Digital 
Sky Survey Data Release 3 (Bouche et al. 2006), yielding 
a host mass of M ~ 10 12 Mq. The large number of objects 
available (both Mgll absorbers and galaxies) yields a sta- 
tistical accuracy of a factor of 2 in host halo mass, much 
better than is available using comparable techniques for 
DLAs at higher redshift (Cook et al. 2006). In addition, host 
masses have been estimated via direct kinematic measure- 
ment in spatially resolved followup spectroscopy (Bouche et 
al. 2007), yielding estimates that are consistent with clus- 
tering results. 

The small number of known DLAs is due to the low den- 
sity of bright background sources. An alternative approach 
is therefore to observe the DLA population directly through 
21cm emission. There is an extensive literature describing 
the use of redshifted 21cm observations as a probe of the 
process of reionization in the high redshift IGM [see Furlan- 
etto, Oh & Briggs (2006) for an extensive discussion]. Reion- 
ization starts with ionized (HII) regions around galaxies, 
which later grow to surround groups of galaxies. The pro- 
cess of reionization is completed when these HII regions over- 
lap (defining the so-called overlap epoch) and fill-up most 
of the volume between galaxies. The conventional wisdom 
has been that the 21cm signal disappears after the overlap 
epoch, because there is little neutral hydrogen left through 
most of intergalactic space. However, the following simple 
estimate can be used to demonstrate that 21cm emission 
should be significant even after reionization is completed 
(Wyithe & Loeb 2008). As mentioned above, DLA systems 
are believed to contain the majority of HI at high redshifts. 
Indeed, observations of DLAs out to a redshift of z ~ 4 show 
the cosmological density parameter of HI to be Qhi ~ 10~ 3 
(Prochaska et al 2005). In the standard cosmological model 
the density parameter of baryons is Qb ~ 0.04, so that the 



mass-averaged neutral hydrogen fraction at z ~ 4 (long after 
the end of the HII overlap epoch) is xm = Sim/fib ~ 0.02. 

At 2 ~ 4, the brightness temperature contrast of red- 
shifted 21cm emission will be AT ~ 0.5mK. Moreover on 
the i2 ~ 10 co-moving Mpc scales relevant for upcoming 
21cm experiments, the root-mean- square amplitude of den- 
sity fluctuations at z ~ 4 is o ~ 0.2. Hence, we expect 
fluctuations in the 21cm intensity field due to DLAs to be 
at least ~ O.lmK on 10 comoving Mpc scales, with a boost 
in this signal if the DLAs are hosted by biased (i.e. massive) 
dark matter halos (Wyithe & Loeb 2008). The fluctuations 
in the 21cm emission signal after reionization are therefore 
expected to be only an order of magnitude or so smaller 
than the largest fluctuations predicted at any time during 
the entire reionization era (e.g. Wyithe & Morales 2007). 
Moreover the sky temperature, which provides the limiting 
factor in the system noise at the low frequencies relevant 
for 21cm studies, is proportional to (1 + z) 2,6 , and so is a 
factor of ~ 3.4[(1 + z)/5] 2,6 smaller at low-redshifts than for 
observations at z ~ 7. 

Several experiments are currently under development 
which aim to detect the 21cm signal during reionization 
(including MWA Q, LOFAR 0, PAPER 0, 21CMA 0) and 
more ambitious designs are being planned (SKA0). In ad- 
dition, the first statistical detection of 21cm fluctuations 
due to discrete, unresolved clumps of neutral gas was re- 
cently made (Pen et al. 2008) through cross-correlation of 
the HIPASS (Barnes et al. 2001) 21cm observations of the 
local universe with galaxies in the 6 degree field galaxy red- 
shift survey (Jones et al. 2004; 2005). This detection repre- 
sents an important step towards using 21cm surface bright- 
ness fluctuations to probe the neutral gas distribution in 
the IGM (both during and after reionization), as well as 
the mass power-spectrum (PS) and cosmology (McQuinn et 
al. 2006; Wyithe, Loeb & Geil 2008; Chang et al. 2007; Mao 
et al. 2008; Loeb & Wyithe 2008; Pritchard & Loeb 2008; 
Wyithe 2008). 

The first generation of low frequency radio arrays will 
have low spatial resolution, and insufficient sensitivity to 
detect individual DLAs in emission. On the other hand, 
the redshifted 21cm emission is sensitive to the total (mass- 
weighted) optical depth of this neutral gas. Observations of 
the redshifted 21cm signal would therefore detect the total 
neutral hydrogen content in a volume of IGM dictated by the 
observatory beam and frequency band-pass, and as a result 
each beam will contain the combined emission from a large 
number of DLAs. It is important to note that observation 
of DLAs via 21cm surface brightness fluctuations would be 
fundamentally different from quasar absorption line stud- 
ies. The surface brightness fluctuations would be sensitive 
to the statistical properties of the population (particularly 
the clustering), but could not be used to study individual 
DLAs. 

Although the 21cm emission after HII overlap is dom- 
inated by dense clumps of gas rather than by diffuse gas 
in the IGM as is the case before reionization is complete, 

1 http:/ /www. haystack. mit.edu/ast/arrays/mwa/ 

2 http://www.lofar.org/ 

3 http://astro.berkeley.edu/ dbacker/EoR/ 

4 http://web.phys.cmu.edu/ past/ 

5 http://www.skatelescope.org/ 
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we do not expect 21cm self absorption to impact the level of 
21cm emission. This conclusion is based on 21cm absorption 
studies towards DLAs at a range of redshifts between z ~ 
and z ~ 3.4, which show optical depths to absorption of the 
back-ground quasar flux with values less than a few percent 
(Kanekar & Chengalur 2003; Curran et al. 2007). The small 
optical depth for self absorption is also supported by theo- 
retical calculations of the 21cm optical depth of neutral gas 
in high redshift mini-halos (Furlanetto & Loeb 2002). More- 
over, DLAs have a spin temperature that is large relative 
to the temperature of the cosmic microwave background ra- 
diation, and will therefore have a level of emission that is 
independent of the kinetic gas temperature (e.g. Kanekar 
& Chengalur 2003). These factors combine to make predic- 
tion of the 21cm signal from DLAs robust against poorly 
understood astrophysical details of galaxy formation. 

In this paper we consider the PS of 21cm intensity fluc- 
tuations due to DLA systems at 2.5 < z < 5.5 where there 
are complementary quasar absorption line studies (§[2]). We 
show that the clustering could be accurately determined us- 
ing low-frequency radio telescopes currently under construc- 
tion (§[3]), and that this clustering will allow for an accurate 
measurement of the DLA mass (§ I4I5[) . We summarise our 
conclusions in §[S] Throughout the paper we adopt the set of 
cosmological parameters based in part of data from WMAP3 
(Spergel et al. 2007) for a flat ACDM universe. 



2 THE 21CM POWER-SPECTRUM OF DLAS 

We begin by describing the 21cm PS following the com- 
pletion of reionization. The amplitude of fluctuations in the 
21cm signal depends on the mass-averaged neutral hydrogen 
fraction in the IGM, and on the clustering bias of DLA sys- 
tems. The latter is related to the galaxy bias of the DLA host 
dark matter halos. If more massive halos tend to house more 
neutral hydrogen, we would expect there to be a mean rela- 
tion between DLA column density and the host halo mass. 
However DLA systems have a range of column densities, 
and so we would also expect significant scatter around this 
mean relation, both because of variation in the HI content 
from galaxy to galaxy, and because the column density for a 
particular galaxy will depend on the line-of-sight probed by 
a particular observation. In addition, there is a small con- 
tribution to the neutral hydrogen content of the IGM from 
Lyman limit systems and the Lya forest. 

For a DLA host with halo mass M, the galaxy bias 
b may be approximated using the Press-Schechter (1974) 
formalism (Mo & White 1996), modified to include non- 
spherical collapse (Sheth, Mo & Tormen 2001) 



b(M,z) = \ + 



i r a. 



+ bv 



/2(l-c) 



7V5 



v' 2 ° + b(l-c)(l-c/2) 



(1) 



where v = S 2 /a 2 (M), v' = y/av, a = 0.707, b = 0.5 and c = 
0.6. Here u(M) is the variance of the density field smoothed 
on a mass scale M at redshift z. This expression yields an 
accurate approximation to the halo bias determined from 
N-body simulations (Sheth, Mo & Tormen 2001). 

Our approach to computing the DLA PS is to assume 
that there is a probability distribution p(b)db for the neutral 



hydrogen mass- weighted clustering bias (b) of DLAs relative 
to the underlying dark matter. This distribution has a mean 
(b), and a characteristic mass where (6) = b(M^). The 
quantity (b) is the typical galaxy bias of halos hosting neu- 
tral hydrogen. Since DLAs dominate the neutral hydrogen 
mass, and are thought to reside in massive, biased halos, 
we refer to this typical bias in the 21cm PS as the bias of 
DLA hosts, and M/m as the DLA host mass throughout this 
paper. 

Our goal is to discuss the statistical precision with 
which Ma,) can be determined via the 21cm PS. As noted 
above, while most neutral hydrogen is located in the DLAs, 
a fraction of HI in the IGM is observed in Ly-limit systems 
with column densities below 2 x 10 20 cm -2 . If these systems 
are found in the same host halo population as the DLAs, 
then Mn,) will faithfully represent the DLA host mass. On 
the other hand, if Ly-limit systems are located in smaller less 
biased halos, then equation @ will lead to an underestimate 
of the DLA host mass. We do not expect this underestimate 
to be significant because the contribution to the observed PS 
is weighted by the the HI mass density squared and the bias 
squared, with DLA systems dominating both quantities. 

Assuming that the relation between HI and halo mass is 
independent of large scale over density (and hence ionizing 
background), the 21cm PS due to DLAs (Wyithe 2008) is 



P 2 i(fc) = 400mK 



l + z 
7.5 



x 2 m P(k) [(b) + ffl 



21 2 



(2) 



where xm = flm/^b is the mass weighted neutral frac- 
tion of hydrogen in the Universe. The PS is evaluated at 
a wavenumber k with modulus k and /i = cos 8, where 8 
is the angle between the line-of-sight and wave-number k. 
In deriving this expression we have assumed xm < 1, as 
is observed following the completion of reionization. The 
peculiar motion induced anisotropy is dependent on cos- 
mology through the evolution of growth factor D via / = 
dlog D/dlog (l + z), and follows from application of redshift 
space distortions (Kaiser 1987) to the 21cm PS. We assume 
/ = 1, which is appropriate at high redshift. From equa- 
tion © we also find the spherically averaged PS 



fti(fc) = 400mK 



l + z 
7.5 



2 

xm 



P(k). (3) 



We note that in the case where (b) = 1 (which corresponds 
to a uniformly ionized IGM), we have 



P 2 i(k) = 400mK 



l + z 
7.5 



[1.87] x 2 m P(k), 



(4) 



which includes the enhancement of fluctuations by the factor 
of ((l + fi 2 ) 2 ) = 1.87 owing to peculiar velocities (Barkana 
& Loeb 2005). Importantly, the PS is sensitive only to the 
first moment of p(b) . As a result, we do not need to calculate 
(or assume) a functional form for this distribution in order 
to estimate the constraints on M^) • 

Examples of spherically averaged power-spectra are 
plotted in Figure [T] (thick grey lines) assuming values of 
(b) corresponding to DLA halo masses of M (b) = 10 10 M Q , 
M (b) = 10 n M Q and M {b) = 10 12 Af Q (bottom to top), at 
each of the redshifts z = 2.5, 3.5, 4.5 and 5.5. We have 
assumed xhi = 0.02 in these models corresponding to obser- 
vations of HI density in DLAs at high redshift (Prochaska et 
al. 2005). The wiggles at wave numbers k ~ O.lMpc" 1 are 
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Figure 1. The spherically averaged PS of 21cm fluctuations due to DLAs after reionization (thick grey lines). We have plotted the PS in 
dimcnsionless form A = fc 3 P(k)/(2n 2 ). The model PS shown was computed assuming three values for the DLA mass, = 1O 1O M0, 

M( b ^ = 10 11 Mq and = 10 12 Mq. Also shown for comparison are estimates of the noise for MWA (thin solid lines) and a future 

extension MWA5000 (thin dashed lines). In each case we plot the uncertainty within fc-space bins of size Afc = fc/10, and have assumed 
observations for lOOOhrs in each of 3 different fields. The uncertainty includes a minimum fc cutoff due to foreground subtraction. At 
z < 3.5 only the MWA5000 is considered since the MWA antennae are not effective at frequencies greater than ~ 300MHz. The noise 
curves, which vary at large fc owing to Poisson noise, are shown only for = 10 12 Mq. 



the baryonic oscillations that are familiar to galaxy redshift 
surveys (Eisenstein et al. 2005). The models with larger DLA 
host mass yield fluctuations with significantly more power, 
owing to the larger galaxy bias. It is this variation in the 
power that enables measurement of the DLA host mass. 



3 SENSITIVITY TO FLUCTUATIONS IN 
21CM EMISSION FROM DLAS 

As an example of the sensitivity of forthcoming low- 
frequency telescopes to the fluctuations in 21cm emission 
after reionization, we have estimated the signal-to-noise for 
the Murchison-Widefield Array (MWA). Calculations of the 
sensitivity to the 21cm PS for an interferometer of this sort 
have been presented by a number of authors. We follow the 
procedure outlined by McQuinn et al. (2006), drawing on 
results from Bowman, Morales & Hewitt (2006) for the de- 
pendence of the array antenna density on radius p{r). The 
uncertainty in a measurement of the PS per mode in the sur- 
vey volume has two separate components, due to the ther- 
mal noise of the instrument (<SP2i,n), and due to sample 
variance within the finite volume of the survey (<5P2i,sv). 
Since we are dealing with discrete systems rather than a 



diffuse IGM, the latter includes a Poisson component due 
to the finite sampling of each mode and equals JPn.sv = 
Pn (fc) [1 + ((b) nDLA-P(fc)) - ], in which we approximate the 
number density of absorbing hosts by tidla ~ M dnsT /dM 
where dnsx/dM is the Sheth-Tormen (2002) mass function 
of dark matter halos. The noise due to sample variance there- 
fore depends both on the survey volume and on DLA host 
mass. We combine the above components to yield the un- 
certainty on the estimate of the PS within a fc-space volume 
element d 3 fc 

AP 21 = [JPai.sv + <5P 2 i,n] /VK- 

where the quantity N c = 2nk 2 sin 6dkd6[V / (2nf] de- 
notes the number of modes observed within d 3 k — 
2nk 2 sin(8)dkd9. In computing N c we assume symmetry 
about the polar angle and express the wave vector fc in 
components of its modulus fc and angle 8 relative to the 
line-of-sight. 

The contamination of foregrounds provides an addi- 
tional source of uncertainty in the estimate of the PS. Mc- 
Quinn et al. (2006) have shown that it should be possible 
to remove the power due to foregrounds to a level below 
the cosmological signal, provided that the region of band- 
pass from which the PS is estimated is substantially smaller 
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Figure 2. The integrated signal to noise for detection of the spherically averaged 21cm PS as a function of DLA mass. The cases of four 
different redshifts are shown for each of the MWA (left panel) and MWA5000 (right panel). 



than the total band-pass available. Following the approxi- 
mation suggested in McQuinn et al. (2006), we set N c = Q 
if 27r/fe cos(f?) > AD, where AD is the co-moving length 
corresponding to the line-of-sight distance over which fore- 
grounds can be removed. The number of modes observed 
depends on the volume of the survey, V = D 2 AD(A 2 /j4 t ii e ), 
where A t n c is the total physical surface area of an antenna, 
and D is the co-moving distance to the redshift of emission. 



3.1 Power-Spectrum Sensitivity of the MWA 

Estimates of the noise for detection of the spherically av- 
eraged PS for the MWA are plotted in Figure [1] (thin solid 
lines). Note that only redshifts z > 3.5 are accessible with 
the MWA antennae design. When complete, the MWA will 
comprise a phased array of 512 tiles (each tile will contain 16 
cross-dipoles) distributed over an area with diameter 1.5km. 
To compute the noise on the PS we model the antennae dis- 
tribution as having p(r) oc r~ 2 with a maximum radius of 
750m and a finite density core of radius 18m. We assume 
a lOOOhr integration on each of 3 fields, and a foreground 
removed bandpass of B = 8MHz within a total processed 
bandpass of 32MHz. 

The sensitivity to the 21cm PS is dependent on both 
the sensitivity of the telescope to a particular mode, and to 
the number of such modes in the survey. The former is set 
by the effective collecting area (A e ) of each antenna element 
(as well as the total number of antennae), while the latter 
is sensitive to the total physical area covered by each an- 
tenna (which we refer to as ^4tiie). This issue is discussed 
in more detail in Wyithe, Loeb & Geil (2008). In comput- 
ing the sensitivity we have assumed A c ~ 16(A 2 /4)m 2 and 
= 16m 2 (corresponding to the design of the MWA). 
The combined uncertainties include the minimum k cutoff 
due to foreground subtraction. Comparison of the noise es- 
timate with the expected 21cm signal shows that the MWA 
could detect the PS at 3.5 < z < 5.5 with high significance 
provided that the DLA hosts are sufficiently massive (and 
therefore biased). 



3.2 Power-Spectrum Sensitivity of the MWA5000 

At values of k ~ a few xlO" 1 Mpc -1 , the measurement of 
the PS using the MWA will be limited by the thermal sen- 
sitivity of the array, and so the signal-to-noise achievable in 
this regime will be greatly enhanced by a subsequent gen- 
eration of telescopes with larger collecting area. As an ex- 
ample we consider a hypothetical followup telescope to the 
MWA which would comprise 10 times the total collecting 
area. We refer to this followup telescope as the MWA5000. 
The design philosophy for the MWA5000 would be similar 
to the MWA, and we therefore assume antennae distributed 
as p(r) oc r~ 2 with a diameter of 2km and a flat density core 
of radius 80m (see McQuinn et al. 2006). For MWA5000 we 
assume the antennae design to be optimised at the redshift 
of observation (in which case we assume A c = A t n c )- In 
Figure [T] we present estimates for measurement uncertainty 
of the 21cm PS using MWA5000 (thin dashed lines). An 
MWA5000 would achieve a high signal-to-noise detection of 
the PS, even in cases where the DLA hosts have a small 
value of galaxy bias. 



3.3 Dependence of SN on DLA Host Mass 

In Figure [2] we show the integrated signal-to-noise ratio 
(SN) for detection of the spherically averaged PS as a func- 
tion of DLA host mass. The SN improves with increasing 
host mass owing to the increase in bias. Four different red- 
shift cases are shown for each of the MWA (left panel) and 
MWA5000 (right panel). Figure [2] quantifies the results de- 
scribed in Figure [1] The MWA can detect the spherically 
averaged PS at a signal-to-noise of ^ 3 for DLA masses 
M (b) > lO n M , while the MWA5000 could detect the PS 
at a signal-to-noise of 3 for DLA masses as small as 
A/( b ) ~ 1O 8 M0. In each case SN is limited at the high- 
est masses by the Poisson noise introduced into the PS by 
the discreteness of the DLA emission. 
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Figure 3. Constraints on DLA host mass and neutral fraction from a spherically averaged 21cm PS. The diagonal contours show the 
loci of points with likelihoods corresponding to 64% (solid lines) and 10% (dashed lines) of the maximum, given a true model with DLA 
masses of 10 12 M Q (upper panels) and 1O 1O M (1 ower panels) combined with a neutral fraction xjji ~ 0.02 and observation with the 
MWA. The vertical lines show the 64% and 10% likelihood contours for neutral fraction based on existing observations of DLAs within 
a redshift bin of width Sz = 0.5. 



4 MEASUREMENT OF DLA HOST MASS 
USING SPHERICALLY AVERAGED 21CM 
POWER SPECTRA 

In this section we consider the spherically averaged PS, 
which would be the observable of choice for 21cm PS obser- 
vations that achieve only modest SN. To estimate the po- 
tential for constraints on the DLA host mass properties from 
the spherically averaged PS, we first calculate the regions of 
parameter space p = (xhi, M/m) that are allowed around 
true models with p . We assume there is no uncertainty in 
the PS of matter fluctuations, which would increase the un- 
certainty somewhat by introducing distortions onto the PS 
via assumption of an incorrect cosmology (Wyithe 2008) . As 
part of this procedure, we construct likelihoods 

where the sum is over bins of k, and AP^ T is the spheri- 
cally averaged uncertainty on measurement of the 21cm PS. 
In Figure [3] we show the likelihood contours for combina- 
tions of xm and M/m = M({b)) given input models with 
M {b) = 1O 12 M (upper panels) and M (b) = 10 10 M Q (lower 
panels), combined with ^hi = 0.02 and observation with 
the MWA (diagonal sets of contours). The contours illus- 
trate the degeneracy between xm and Mn,\ in the resulting 
PS. Also shown is the existing constraint on the neutral 



fraction from quasar absorption line studies [vertical sets of 
contours showing the current uncertainty within Az — 0.5 
bins (Prochaska et al. 2005)]. The figure illustrates how the 
neutral fraction constraint can be used to break the degen- 
eracy. It is clear from Figure [3] that the constraint on xhi 
will be the limiting factor for measurement of the DLA host 
mass via the spherically averaged PS (in cases where the 
host bias is large). 

From equation (|3]) we see that measurement of the 
spherically averaged PS with a signal to noise .SiV yields 
a fractional error on the combination 6dlaXhi with value 

A(bpLAgHl) _ 1 

boLAXm ~ 2SAT [ ' 

where we have defined the quantity &dla = 
\J (b) 2 + 2/3 (6) + 1/5. In deriving equation (J6j) we have 
assumed the mass PS to have negligible uncertainty. The 
measurement of neutral fraction with uncertainty Axm 
is independent of the PS (having been obtained via the 
column density in DLAs along the line-of-sight towards 
high redshift quasars). The uncertainty in the characteristic 
bias (A&dla) is therefore 

A6 DLA = 6dla ,/(^^e) ) 2 + ( ^L)\ (7) 
© 0000 RAS, MNRAS 000, 000-000 
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Figure 4. The uncertainty (in dex) on the mass of the DLA host achievable using a spherically averaged 21cm PS. The cases of different 
redshifts are shown for each of the MWA (left panels) and MWA5000 (right panels). Upper panels: The fractional uncertainty in xjji was 
assumed to be Ai'hi/^hi = 0.2 (corresponding to one Sz = 0.5 bin in the SDSS study of Prochaska et al. 2005). Central panels: The 
fractional uncertainty in aim was assumed to be Axm/xm = 0.09 (corresponding to a combination of all five Sz = 0.5 bins in the SDSS 
study, or a future survey with smaller error-bars). Lower panels: The fractional uncertainty in xhi was assumed to be negligible. In the 
central and lower panels, the curves for Axhi/^hi = 0.2 are reproduced for comparison (thin grey curves). 



The corresponding uncertainty in (b) is 



A(b) 



d(b) 

d&DLA 



(b) + 1/3 



A6dla, 



yielding the uncertainty in host mass (in dex) 

d\ogM {b) 



A(log M (b) ) = 



d(b) 



-Mb), 



(8) 



(9) 



where ri(log M ^ )/d(b) is determined via equation |T]). 

The values of A(logM^)) that are obtained via equa- 
tion © represent the dispersion of a likelihood function 
£(log A'/ bs| log M(p) ) for observation of logA/ b s given a 
true value logM^. Following the observation of a 21cm 



DLA PS the likelihood function could then be used to ob- 
tain an a-posteriori measurement of the DLA host mass. 
Thus, in the instance of a flat prior probability distribu- 
tion for the logarithm of host mass, the dispersion approxi- 
mates the error (in dex) achievable for the mass of the DLA 
host given the true mass Mn,\ . The dispersion from equa- 
tion ([9]) is plotted in Figure [4] as a function of host mass, at 
a range of redshifts, and for different assumptions regard- 
ing the uncertainty in neutral fraction and properties of the 
21cm telescope used [the MWA (left panels) and MWA5000 
(right panels)]. The fractional uncertainty in xm was as- 
sumed to be Axm/xm = 0.2 [upper panels; corresponding 
to one Sz — 0.5 bin in the SDSS DR3 study of Prochaska 
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Figure 5. Constraints on the DLA host masses derived from measurement of the anisotropy of the 21cm PS. We assume observations 
using the MWA and 3 fields integrated for 1000 hours each. Results are shown at 2 redshifts, and for 3 values of the true host mass 
(M( b j = 1O 1O M0, 1Q 11 Mq and 10 12 Mq; from bottom to top). We assume a true value of xj^i = 0.02. In each case likelihood contours 
are shown at 10% and 64% of the maximum for the parameter set (ccm, ^(b) )■ Also shown are the corresponding likelihoods for . We 
have assumed a flat prior probability for Mn,) > and a Gaussian prior probability for the neutral fraction with a variance of Axjji = 0.2x'hi- 



et al. (2005)] and A^hi/xhi = 0.09 [central panels; corre- 
sponding to a combination of all five 8z = 0.5 bins in the 
Prochaska et al. (2005) study]. With current observations 
of DLA absorption systems, the host mass could be deter- 
mined to within a factor as small as 2-3, provided that the 
hosts are massive. 

It is apparent from equation (0 that the measurement 
precision of the DLA clustering bias (and therefore the DLA 
host mass) is limited by the uncertainty in neutral fraction 
once SN S> (i£Hi/A:r;Hi)/2 > 3 — 5. Thus, with current ob- 
servations of quasar absorption systems, the DLA host mass 
estimate using a spherically averaged PS would not be lim- 
ited by the PS measurement over a range of masses for the 
MWA. Moreover, the mass estimate would not be limited by 
the PS measurement at any host mass using the MWA5000. 
We therefore show additional results for the case where the 
error in measurement of (b) is dominated by the uncertainty 
in determination of the 21cm PS rather than the neutral 
fraction. The resulting values of A(log Ai^ ) are shown in 
the lower panels of Figure U The curves suggest that the 
achievable accuracy would be limited by the determination 
of Qm for host masses larger than ~ 1O 1O M0 using the 
MWA. In this most optimistic case the MWA5000 could 
measure the characteristic DLA host mass to the level of 
a few percent over the entire plausible mass range. 



5 MEASUREMENT OF THE DLA HOST MASS 
USING ANISOTROPY OF THE OBSERVED 
21CM POWER-SPECTRUM 

In the previous section we discussed the constraints that 
could be placed on the DLA host mass via a spherically 
averaged 21cm PS. That analysis broke the degeneracy be- 
tween mass averaged neutral fraction and the galaxy bias 
of DLA hosts using an independent measurement of neutral 
fraction from the HI column density in quasar absorption 
systems. In this section we use the peculiar velocity induced 
anisotropy of the observed PS to break the degeneracy be- 
tween bias and neutral fraction using the fact that these 
quantities contribute differently to the PS viewed at differ- 
ent angles relative to the line-of-sight (equation [2}. 

To estimate the potential for constraints on the DLA 
host mass, we again calculate the region of parameter space 
p — (asm) Mn,\ ) that is allowed around a true solution with 
p a . We assume there is no uncertainty in the PS of mat- 
ter fluctuations, which would increase the uncertainty some- 
what by introducing distortions onto the PS via assumption 
of an incorrect cosmology (Wyithe 2008). As part of this 
procedure, we construct likelihoods 

ln/ V^ _ 1 V ( PAT(fc,M,p)-Pj T (fc, M ,p ) \ 2 

2 tA AP AT (fc, M ) ) ' (W) 
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Figure 6. As per Figure[5] but assuming observation using the MWA5000, and no prior constraint on aim- 



where the sum is over bins of k and p. 



The results are shown in Figure [5l6l for the MWA and 
MWA5000 respectively using the observing strategy out- 
lined in § [4] In each case likelihood contours are shown for 
the parameter set (xm, M( b \) at two redshifts, z — 2.5 and 
3.5, assuming a true value of xm = 0.02 and three true 
values of host mass M {b) = 1O 1O M , 10 n M Q and 1O 12 M . 
Also shown are the corresponding likelihoods for M/m . We 
have assumed a flat prior probability for M/ b \ . The SN of 
the MWA is not sufficiently large to break the degeneracy 
between bias and neutral fraction. Therefore, as in the pre- 
vious section we impose a Gaussian prior probability on xhi, 
with a variance of Axm = 0.2iehi, corresponding to current 
uncertainty from quasar absorption line studies (Prochaska 
et al. 2005) . Figure [5] shows that at the sensitivity of the 
MWA, the constraints on the DLA host mass are very sim- 
ilar to those obtained from the spherically averaged PS. As 
a result nothing is gained by considering the angular de- 
pendence. On the other hand, the degeneracy is broken at 
the SN obtained by the MWA5000. As a result, we have 
not imposed a prior probability on the neutral fraction from 
quasar absorption studies in this case, and the allowed val- 
ues of xm in Figure [6] are derived directly from the 21cm 
PS, in addition to M {b) . Figure [6] shows that the MWA5000 
(Figure O would obtain accuracies of a few percent on the 
neutral fraction, and a few 10s of percent in the DLA host 
mass. 



6 CONCLUSIONS 

Observations of DLA systems out to redshift z ~ 5 show 
the density parameter of HI to be flm ~ 10~ 3 , indicat- 
ing that the mass averaged neutral hydrogen fraction re- 
mains at the level of a few percent through most of cos- 
mic history. These DLA systems will produce 21cm inten- 
sity fluctuations, whose power-spectrum has an amplitude 
that depends both on the total mass of neutral hydrogen 
within the DLA systems (f2m), and the masses of the DLA 
host halos (through galaxy bias). Since f2m is measured via 
quasar absorption line studies, we show that measurement 
of the spherically averaged 21cm power-spectrum amplitude 
at 2 < z < 5 could be used to determine the DLA host mass. 
Using a telescope of collecting area equal to the MWA, the 
DLA host mass could be determined to within a factor as 
small as ~ 2 (provided the hosts were massive galaxies), 
with the accuracy limited by the determination of fim in 
this case. 

In observations of the 21cm power-spectrum with high 
signal-to-noise, the limitation of the quasar absorption line 
derived neutral fraction could be removed by including the 
observed angular dependence due to peculiar motions in 
the analysis. In this case the measurement of DLA host 
mass would be limited by the sensitivity to the 21cm power- 
spectrum, and future telescopes with larger collecting area 
would significantly increase the precision (to within a few 
10s of percent) with which the DLA host mass could be 
determined. 

The DLA host mass is currently estimated via cluster- 
ing analyses of absorption systems which are very rare, only 
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being found in one of every few quasar lines of sight. Thus 
measurement of the 21cm power-spectrum, which measures 
the clustering of all DLAs in a large volume, has the poten- 
tial to greatly increase the precision with which the DLA 
host mass is known. Since DLAs are thought to host the 
majority of gas available for star formation, measurement of 
the DLA host mass will be a valuable contribution to our 
understanding of galaxy formation and the star formation 
history. 
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